Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support moving theta and models to a specific device. #9

Merged
merged 2 commits into from
Apr 25, 2024

Conversation

stellaraccident
Copy link
Contributor

@stellaraccident stellaraccident commented Apr 25, 2024

  • Threads explicit device through models.
  • Implements functional InferenceTensor, Theta and Dataset transformations and uses it to implement to(device=).
  • Adds --device foo to example runner.
  • With Add device bridge support for HIP and CUDA. iree-org/iree-turbine#3 and supporting patches, this allows custom ops and kernels to transparently be used on CUDA/ROCM devices (instead of just CPU).

* Threads explicit device through models.
* Implements functional InferenceTensor, Theta and Dataset transformations and uses it to implement `to(device=)`.
* Adds `--device foo` to example runner.
* With iree-org/iree-turbine#3 and supporting patches, this allows custom ops and kernels to be transparently be used on CUDA/ROCM devices (instead of just CPU).
@stellaraccident stellaraccident merged commit a50e231 into main Apr 25, 2024
2 checks passed
@stellaraccident stellaraccident deleted the explicit_device branch April 25, 2024 03:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants